Local discrimination of mixed states 
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We provide rigorous, efficiently computable and tight bounds on the average error probability of 
multiple-copy discrimination between qubit mixed states by Local Operations assisted with Classical 
Communication (LOCC). In contrast to the pure-state case, these experimentally feasible protocols 
perform strictly worse than the general collective ones. Our numerical results indicate that the gap 
between LOCC and collective error rates persists in the asymptotic limit. In order for LOCC and 
collective protocols to achieve the same accuracy, the former requires up to twice the number of 
copies of the latter. Our techniques can be used to bound the power of LOCC strategies in other 
similar settings, which is still one of the most elusive questions in quantum communication. 



Quantum communication and computation tasks in- 
volve, broadly speaking, transforming an input state and 
reading the corresponding output state. One of the most 
prominent features of quantum mechanics is that, hard 
as one may try, the readout will be unavoidably imper- 
fect unless the various output states are orthogonal. This 
has both fundamental and practical implications that lie 
at the heart of quantum mechanics and its applications. 
The most simple scenario where this deep fact manifest 
itself is what is known as quantum state discrimination 
or, more generically, quantum hypothesis testing In 
its simplest form, given one of two possible sources that 
provide N independent copies of a state po or p\ (i. e., 
CT o = Pq N or a i = pf N ), we ask ourselves what is in av- 
erage the minimum error probability P e of making a con- 
clusive guess of the identity of the source/state. Hereafter 
we will refer to this minimum average simply as the er- 
ror probability, and for simplicity we will further assume 
equal prior probabilities for p a , a = 0, 1. State discrimi- 
nation is an essential primitive for many quantum infor- 
mation tasks, such as quantum cryptography or even 
quantum algorithms [3j. Moreover, with the remarkable 
experimental advances in the preparation and measure- 
ment of quantum states it becomes essential to have a 
theory to assess the performance of state discrimination 
protocols. 

The error probability for two arbitrary states was given 
already decades ago by Helmstrom [J], who provided a 
formal expression for P e in terms of the trace distance 
between the two density matrices Co and a\ . An explicit 
simple expression can be given for single qubit states 
(N = 1), and only very recently the asymptotic error 
rate when N — > oo for qudits has been found through 
the quantum Chernoff bound 0, ||| . For finite (moder- 
ately large) number of qubits the permutation invariance 
of the multiple-copy states {c a }, of size ~ 2^, enables 
us to write them in block-diagonal form. Thus, we can 
numerically compute the error probability in terms of 
the trace-distance between small-sized (~ N) blocks, and 
the difficulty of the problem (required memory size) be- 
comes polynomial in N despite the exponentially grow- 
ing dimension of the Hilbcrt space we are dealing with. 



This collection of results constitute a fairly complete 
theory regarding the optimal (unconstrained/collective- 
measurement) multiple-copy discrimination of quantum 
states. 

The picture changes completely when it comes to dis- 
crimination of states using Local Operations (on indi- 
vidual copies) and Classical Communication (LOCC) in- 
stead of general (collective) measurements. This sce- 
nario is interesting from the fundamental point of view, 
as it sheds light on the role of quantum correlations in 
quantum information tasks, but it is of paramount in- 
terest from a practical point of view since it puts under 
scrutiny the attainability of previous bounds in imple- 
mentations, where collective measurements are usually 
unfeasible. For pure states it has been shown 0] that the 
minimum collective error probability can be attained by a 
LOCC one-way adaptive protocol consisting in perform- 
ing a different von Neumann measurement on each copy, 
where each measurement is chosen according to the out- 
come^) of the previous one(s). Furthermore, it is shown 
that an even simpler fixed local strategy, where the very 
same particular von Neumann measurement is repeated 
on every copy, though not optimal for finite N, does pro- 
vide the collective asymptotic error rate as N — ¥ oo. The 
latter also holds when only one of the states is pure @ . 

For mixed states, recently, Hayashi |8[ has proven that, 
as far as the asymptotic error rate is concerned, one- 
way adaptive strategies are not advantageous over fixed 
strategies, which do not make use of classical communi- 
cation between measurements. In the last months Hig- 
gins et al. [9] have studied theoretically and experimen- 
tally the performance of various local strategies including 
adaptive von Neumann measurements. These adaptive 
strategies, which are optimized using dynamic program- 
ming (DP) techniques [lOj, outperform the others un- 
der consideration, but there still remain the fundamental 
open questions of whether or not these strategies are the 
best one can achieve by LOCC (which include general- 
ized local measurements and unlimited communication 
rounds), and whether or not those can attain the collec- 
tive bounds. 

In this letter we show how to efficiently compute 
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bounds on LOCC protocols. This enables us to compare 
such protocols to their collective counterparts and bench- 
mark the performance of particular state discrimination 
strategies or experiments. Before presenting our analy- 
sis, let us note that there has been some recent interest in 
the related problem of LOCC discrimination of bipartite 
states (see ll| and references therein) , where each of the 
parties has joint access to a share of all copies. We find 
that in our scenario, although the states are disentan- 
gled, quantum correlations play an essential role, which 
is, in this sense, a manifestation of non-locality without 
entanglement. 

At this point we need to go into a more technical dis- 
cussion. The error probability for the case under consid- 
eration can be written as |j| 



no need to apply T to check PPT, the sizes of the matrices 
we have to deal with grow polynomially (quadratically) 
in N. 

Applying the procedure sketched above requires, 
nonetheless, finding an efficient parametrization of PT 
invariant matrices in block-diagonal form. The first step 
towards this end is identifying the independent matrix 
elements in the computational basis \i1_i2 ■ ■ ■ In)- For TV 
qubits, the operator E can be written as 
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where all i p , i' s (p,s = 1,2... N) are either or 1 and 
each sum runs over the 2 N possible binary lists (num- 
bers) of N digits. Invoking permutation invariance and 
hermiticity the independent components of E can be cho- 



sen to be £ix;;;ii 



11. ..10. ..011. ..10. ..00 
100. ..00. ..00 



ci' , where the first R 



The matrix E, together with 1 — E, are the elements 
of the Positive Operator Valued Measure (POVM) that 
represents mathematically the measuring protocol. Note 
that they can be taken to be symmetric under permuta- 
tions of the individual systems (invariant under the ac- 
tion of the symmetric group Sn), because so are <7o and 
o 1 , and can thus be put in block-diagonal form. 

We realize that Eq. (fTJ) defines a Semidefinite Program- 
ming (SDP) problem, for which very efficient numerical 
algorithms have been recently developed [ij} . Bounds on 
LOCC strategies could be obtained if in addition to the 
positivity constraint < E < 1 on the POVM elements 
E and 1 — E one would further impose, e. g., Positive Par- 
tial Transposition (PPT), i. e., < E T < 1 (recall that 
this condition defines the set of PPT-preserving opera- 
tions 



digits in the subscript are ones and the Q (Q 1 ) first dig- 
its on top of them (the remaining N — R zeros) are also 
ones. We further impose that R < Q + Q' and note 
that Q < R. We next use PT invariance to exchange the 
last R — Q ones in the subscript with the zeros on top 
of them by raising (lowering) the corresponding i p (i' p ). 
This proves that the PT invariant matrices we are dealing 
with have (N + 1)(N + 2)/2 independent components: 
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^11... 10. ..00. ..00 
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where q and r are the number of ones in the superscript 
and the subscript respectively. 

We next wish to write E in block-diagonal form. To 
this end, we map each qubit to a spin 1/2, \i p ) — > \m p ) — 
I (— 1)%>/2), where m p is the magnetic number of the p-th 
spin, and change from the uncoupled (computational) ba- 
13(, which includes the set ofseparable operations sis to the tota ] spin eigenbasis {\j, m) }£,=_,,•, which span 

the irreducible representations (irreps) of SU(2). In this 
basis E becomes block-diagonal and the matrix elements 
of each block Ev\ i. e., [-E^]™ = (j , m\E\j , m') , are ex- 
pressed as linear combinations of the independent param- 

eters £?. We write = £ r ? [M^] ™' g r £?, which 

facilitates the SDP implementation of the optimization. 
Some comments are in order. For given j and m, the 
state I j, m) is degenerate. Note, though, that all blocks 
with the same j are identical, as E is fully symmetric. 
Therefore, the contribution of E^> to the error probabil- 
ity will have to be multiplied by the corresponding de- 
generacy, nj = ( N/ N 2 _ ] )(2j + l)/(JV/2 + j + 1). The ma- 
trices turn out to be (see supplementary material) 



and, in turn, LOCC operations). The difficulty here is 
that Partial Transposition (PT), which we denote by the 
superscript T, does not preserve the block-diagonal form 
of E (just note that T breaks permutation invariance). 
Hence, the size of the matrices (in particular, E T ) one 
needs to deal with remains ~ 2^ and the error probabil- 
ity cannot be computed but for very small values of N. 
We next show how to go around this problem. 

The main observation is that for (two) qubit-state dis- 
crimination the POVM element E can always be cho- 
sen to be PT invariant, E T — E (for any bipartite split 
of the N qubits). This follows from the fact that with 
the appropriate choice of basis, one can always cancel all 
phases in po and pi simultaneously, so that they have only 
real elements and are, therefore, symmetric: pj = p a . 
Obviously, this implies PT invariance, cr£ = cr a , for any 
bipartite split. Hence, for a given E that satisfies PPT, 
the PT invariant operator E' = (E + E r )/2, which also 
satisfies < E' = E' T < 1, provides the exact same er- 
ror probability P e . Therefore, we can restrict ourselves 
to PT invariant operators without any loss of generality. 
Since E can be put in block-diagonal form, and there is 
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In the above expressions the sums run over all integer 
values for which the factorials make sense. We note in 
passing that the very same coefficient (I2U1) appears in 
the expression of the Wigner d-matrices, which give the 
different irreps of a rotation about the y axis (see sup- 
plementary material). 

Now that we have a minimal parameterization of the 
operators that are invariant under permutations and 
partial-transpostions we can compute the error proba- 
bility by the following SDP instance: 

n PP ^{l + mm$>,tr [{W-<&)E&\ }. (6) 



Here the minimization is constrained by < E^) < t, 
for all possible values of the total spin j, and the matrix 
blocks <Ja^ are computed to be 
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FIG. 1: (color online) LOCC lower-bound P e PPT (solid 
line) and error probability for collective (squares), adaptive 
(crosses) and repeated (dashed) strategies for ro = n = .8 
and 6 = w/2. Inset: Gap [Eq. ©] vs. N for 6 = tt/2, r = .8 
and r\ = 1 (solid), n = .9 (dotted), n = .5 (dotted- 
dashed), n = .4 (dashed). 
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where r a , often referred to as purity or degree of mixed- 
ness, is the length of the Bloch vector r a of the single 
qubit state p a , and 9 is the relative angle between r*o 
and f\. 

As argued above, P p PT provides a lower bound to the 
error probability attainable by the most general LOCC 
strategy, which includes weak generalized local measure- 
ments interlaced with an unlimited number of classical 
communication rounds. In what follows we will com- 
pare this bound to the error probability of the optimal 
collective strategy P™ 1 and to that of two LOCC strate- 
gies that use rank one projective measurements on the 
individual copies. More precisely: i) Repeated strategy, 
where the very same two-outcome measurement is per- 
formed on every copy. The error probability, Pg Cp , is 
obtained by minimizing over the azimuthal angle O that 
specifies the unit Bloch vector of the two measurement 
projectors. The corresponding asymptotic error rate can 
also be obtained from the classical Chernoff bound [fj. 
ii) Adaptive strategy, where copies are measured sequen- 
tially and the choice of the azimuthal angle O s , corre- 
sponding to the measurement on the s-th copy, depends 
on the outcomes obtained upon measuring the preced- 
ing s — 1 copies, i.e., it makes use of one-way communi- 
cation. If the number of available copies, TV, is known 
beforehand it is possible to find the optimal adaptive 
strategy very efficiently using DP as detailed in the 
supplementary material. 

Figure [1] shows the error probability of discrimina- 
tion between two states with equal purity (ro = r% = .8) 
and 9 — tt/2 for the various strategies/bound dis- 
cussed above. We notice that there is a significant gap 



between the collective strategy and the LOCC lower- 
bound (P e PPT ). In addition we note that the error proba- 
bility for repeated and adaptive strategies fall almost on 
top of the LOCC lower-bound. This shows that Pj PT is 
a very tight bound and that it can be taken as a good 
estimate of the minimal LOCC error probability for most 
practical purposes. The figure clearly shows that, as ex- 
pected, the error probability falls exponentially with N: 
P e ~ e~ CN . By fitting the data in the figure to an 
error rate of the form C = C + d log N/N + C 2 /N 
for 25 < N < 35, we can obtain its asymptotic value Cq 
for the various strategies. For the collective and repeated 
strategies the results agree up to the third significant 
digit with the analytical results provided by the quan- 
tum and classical Chernoff bounds respectively |6j . More 
interestingly, within numerical accuracy the fits indicate 
that the gap between collective and LOCC error rates 
persists in the asymptotic limit. The results are also con- 
sistent with a convergence of the asymptotic error rates 
for the two LOCC strategies and that of the LOCC lower- 
bound (P e PPT ), although here, due to the already small 
differences, it is harder to exclude the existence of a (tiny) 
non- vanishing gap. 

In the inset of Figure [1] we plot the gap between 
the collective error rate and that of the LOCC lower- 
bound (P e PPT ), i. e.: 



A = C 



col 



i pcol 

^ PPT = ->g^. (8) 



We notice that the gap reaches its asymptotic value al- 
ready for a small number of copies. This is so for all 
values of r\ but for r% = 1 (solid line), for which A, after 
growing to a maximun at N 4, decreases to zero as 
it should, according to Ref. @. There, as mentioned at 
the beginning of this letter, it is shown that when one 
of the states, say pi, is pure the collective error rate is 
asymptotically attainable by a repeated strategy. More 
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precisely, by one consisting in performing the measure- 
ment defined by E = p\ on each copy. The unknown 
state is claimed to be p\ if the N outcomes of the mea- 
surements correspond to E (none to 1 — E), and it is 
claimed to be po otherwise (unanimity vote). The asymp- 
totic error rate of this strategy attains the upper-bound 
Co < — logF(p , Pi), where F(p ,pj) is the fidelity, de- 
fined as F(p , pi) = (tr | ^/po^/pi |) 2 [6j. For the collective 
strategy it also holds that -(1/2) logF(p , Pi) < Cg o1 . 

Figure [5] shows the error rate C = — (l/A)logP e , 
for two equally mixed states, = n/2 and N = 25, 
as one varies their degree of mixedness r. We identify 
four parameter regions in this plot: i) For very mixed 
states (r < .5) collective and repeated local strategies 
have essentially the same performance, ii) As the pu- 
rity increases (.5 < r < .8), the collective strategy starts 
to outperform the LOCC one, but the repeated strat- 
egy nearly attains the LOCC lower-bound -P e PPT (upper- 
bound on the error rate). The measurements of this re- 
peated strategy have 9 = 7r/2, which means that their 



Bloch vector is proportional to r*o — r\ [14J . The decision 
is taken by 11 'majority vote" , i.e., the most frequent out- 
come determines the decision, iii) At very high purities 
(•8 < r < r*) the three LOCC curves start to split, iv) At 
purities larger than a critical one, r* , all LOCC curves 
start to rapidly converge to the collective one. At r = r* 
the measurement angle of the repeated strategy starts 
to change from 9 = 7r/2 towards 9 = 7r/4 as r — > 1 
(for arbitrary 9, one has 9 — > 9/2, i.e., the Bloch vec- 
tor of the measurements goes to either r*o or fi) and 
the decision rule gradually shifts from a majority vote to 
the unanimity vote described above. In the asymptotic 
limit N — >■ oo, r* can be computed to arbitrary accuracy 
as a solution of a transcendental equation; for 9 = ir/2 
one has r* w .9819. For r < r* the error rate of the re- 
peated strategy, Cq CP , saturates the fidelity lower-bound 
introduced above: C ep = — (1/2) log F(po, pi). 

As mentioned in the introductory part, for asymp- 
totically large N repeated and one-way adaptive error 
rates coincide [8]. The tiny gap between the correspond- 
ing (dotted) curve and the crosses in Fig. [5] is explained 
by the relatively small number of copies (N = 25) used 
in the plot. It is plausible, and consistent with our data, 
that also the gap between the LOCC error rate bound 
(dashed line) and Cq 6P vanishes as N —> oo. 

Note that the smaller the error rate, the more copies 
we need to achieve the same error probability. The ra- 
tio / = (7 c °i/(7 ppT tells us that we need fN copies in or- 
der for the best LOCC strategy to discriminate with the 
accuracy of the collective one. The general features of / 
as a function of r can be immediately grasped from Fig. [2] 
For very mixed states LOCC and collective strategies re- 
quire a similar number of copies (/ rs 1) to discriminate 
with the same error probability, but as the states become 
more pure the LOCC strategies demand up to twice this 
number (i. e., / < 2, which asymptotically is the ratio of 




the two fidelity bounds) . In the limit of very pure states 
(r > r* ) the factor / drops back down again to one. 

0.8 r 



0.6 
C 0.4 
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FIG. 2: (b) Error rate vs. r = r = n for N = 25 and = tt/2, 
and for collective (solid), LOCC (PPT bound) (dashed), and 
repeated (dotted) strategies. Inset shows in addition the error 
rate for the one-way adaptive strategy (crosses). 

In summary, we have lower-bounded the error proba- 
bility of LOCC discrimination between two qubit mixed 
states. To do so, we have characterized all permutation- 
and PT-invariant operators, which may find applica- 
tion in other quantum communication problems involv- 
ing LOCC. Our results indicate an error rate gap between 
the best LOCC and collective discrimination protocols 
that persists as the number of copies goes to infinity. This 
gap takes its largest value in the region of nearly pure, 
but strictly mixed, states. Excluding this region, there 
are no significant differences in performance between the 
simplest (repeated) and optimal LOCC strategies. 
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SUPPLEMENTARY MATERIAL 

Section I of this supplementary material contains de- 
tails on the derivation of the block-diagonal form of per- 
mutation and PT invariant operators on (C 2 )® N . Sec- 
tion II provides an example of dynamic programming 
applied to state discrimination with one-way LOCC mea- 
suremens 



I. Permutation and PT invariant operators in 
block-diagonal form 

Let us consider the permutation invariant operator 



' |iiz 2 - - - *at) (*i*2 • ■ • *jvK (9) 



where {|ix*2 ■ • ■ *jv)} is the (qubit) computational basis 
and each sum runs over the 2 N possible sequences of N 
bits. Permutation invariance implies that E can be writ- 
ten in block-diagonal form (the blocks corresponding to 
irreducible tensors) by an appropriate change of basis. 
This is more familiarly done in the context of spin and / or 
angular momentum, so we map each qubit to a spin 1/2, 



\i P ) -+ K> = |(— 1)^/2), 



(10) 



where m p is the magnetic number of the p-th spin. 
In this, more physical, picture block-diagonalization is 
achieved by the total spin (J) eigenbasis {\j, m, a)} 3 m __j. 
The index a labels the bases that span the various equiv- 
alent irreducible representations (irreps) of SU(2), each 
corresponding to a particular way of coupling the indi- 
vidual spins to give the same total spin j. Permuta- 
tion invariance also implies that the projections of E on 
these irreducible subspaces, span({| j, m, a)} J m= _j), are 
identical for all values of a (they depend only on j). 
Hence, from hereafter, we drop this index and, e. g., 
write [BW]™ = (j,m\E\j,m') for the matrix elements 
of the blocks. It follows that, 



where we implicitly use (|10|) and the sum runs over all 
possible configurations of the magnetic numbers such 
that 



N 



N 
s=l 



(12) 



Since all the matrix blocks with the same j are identi- 
cal, to compute the values of the (generalized) Clebsch- 
Gordan coefficients {mxm% . . . mpf\j, m) in (jllj) it suffices 
to couple the individual spins in the way that leads to the 
simplest calculation. For a given j, it proved useful to 
pair in singlets the N — 2j spins/qubits that are at the 
beginning of the sequence and couple the remaining 2j 
to give total spin j. Thus, we take the state |j, m) to be 
of the form 



\j,m) 



\i>- 



v®(iV/2-j) 



\j, m) 



(13) 



where |V>-) is the spin singlet \\ip_) = (|01) - |10))/V2 
in binary notation] and \j, m) sym is the fully symmetric 
state of 2j spins with total spin j. With this choice, 
the only non-vanishing Clebsch-Gordan coefficients are 
those for which the first N — 2j spins in (miffi2 . . . 
consist of I pairs of the form (i — 1| = (a\ and p pairs 
of the form (~ || = (01 = a and 10 = p in bi- 
nary notation) with I + p = N/2 — j. In this case, and 
further assuming that conditions (fT2"|) hold, a straightfor- 
ward calculation yields 



(mim 2 . ..TOjv|j,m) = (- 



N 

2— •» 



2j 

j + m, 



(14) 



where we have used that the fully symmetric 
state \ j, m) sym is an equal superposition of 



■+m) vec tors 



of the form 

We can use again permutation invariance to cast the 
independent components of E into the form 




j^a...a fj,. 

^ 0...0 <j...<j 11. ..1 1...100.. 



l+p 



.00 _ el,p;Q,Q' 
.00 — C R ' 



(15) 



where I, p, R, Q and Q' stand for the number of cr-pairs, 
/i-pairs and ones indicated by the braces. For a fixed 
choice of I and p, the unpaired bits are arranged so that 
in the subscript sequence (the row index of E) no zero 
precedes a one. The first R bits on the right of the /i-pairs 
in the superscript sequence (column index) are arranged 
along the same pattern, and so are the bits placed on top 
of the N — I — p — R zeroes of the subscript sequence. 
Note that Q < R and 
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0,0;Q+Z,Q'+p 
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Q+l,Q'+p 
R+l+P 



(16) 



It is a straightforward exercise in combinatorics to com- 
(11) pute the number of matrix elements of E whose value 
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is £ 



i-,p;Q,Q' 
R 



2 i+ P 



, It is given by 
p\fN-2l-2p\fR\ (N-2l-2p-R 



I 



R 



Q 



Q' 



(17) 



The first factor takes into account that by applying a 
permutation (thus leaving E unaltered) we can replace 
any of the I +p c-pairs in the subscript by a /i-pair (with 
an exchange p, -O- a at the exact same position of the 
superscript sequence). The second factor is the number 
of ways the I er-pairs (and the p /i-pairs) can be placed in 
the first I + p positions of the superscript sequence. The 
third factor gives the different ways the R ones in the 
subscript sequence can be placed in N — 21 — 2p positions 
(with a corresponding repositioning of the bits right on 
top of them). Similarly, the forth (fifth) factor is the 
number of ways Q (Q') ones can be placed on top of 
the R ones (N — 21 — 2p — R zeroes) of the subscript 
sequence. 

With the Clebsch-Gordan coefficients (TT4")) . the nota- 
tion introduced in (fT5|) . and the combinatorics in (TlTl) . 
we can give an explicit expression for the matrix-blocks 
EW as 



JY 



— -3 

— "X h ~?T ~~j k.j — rti'—k 



1=0 k=0 

x"-f7%^e-")o^-*)- <i8 » 

where we have used that p — N/2 — j — l, R = j — m, Q' = 
j — m! — Q, and finally replaced the dummy index Q by 
k. Using Eq. (|16[) and making the changes k — > j — m — k 
and I —> NJ 2— j — I, this last expression reduces to 



i 



K _ A 

2 J 
I 



N 

■ ~2 —m—k—l,m—m +k+l 
N 
2 



x (-l)'^ 2 '" '" , (19) 



where we have defined 



r A 0)l m ' = V(j-rn)\(j + m)\(j-m')\{j + m')\ 

1 k im (j-m-k)\(j+m'-k)\{m-m'+ky.kV [ ' 

and the sums run over all integer values for which the 
factorials make sense. 

Partial transposition invariance implies that 



Q,Q' _ gO,Q'+R _ f-Q'+R 



Q 



(21) 



blocks E^\ 

[e&]£ = E[ A * } ]- E 



A- 



I 



N 



~2 — m—K—l 



(22) 



from which Eq. (4) in the main text follows immediately. 

Eqs. (TT91 provides a useful parameterization of gen- 
eral permutation invariant operators. As an example, we 
compute the block decomposition of the product state 
a = p® N , which is also used in the main text [Eq. (7)]. 
The independent components of er are easily found us- 
ing (|15l) [here we do not use superscripts; we write pij 
(i, j = 0, 1) instead of p? for the matrix elements of p] 



cQ,Q' _ Q R-Q Q' N—R—Q' 

C R — PllPlO PoiPoO 



(23) 



Substituting in Eq. (|19|) we obtain 



, / N 

[<7«]™ = E^fe^m E 2 



/ k m — ni'+k j—m—k j-\-m' — k 

x (- 1 ) PioPoi Pit Poo 



N 



x (PioPoi) (pnPoo) 2 



-3 -I 



(24) 



Note that the sum over I can be readily performed to give 
(pooPn ~ PwPoi) N/2 - j = (detp) N / 2 -^ and 



= (detp)^5>fc] 



(i)im' 



v n k n m-m'+k J—m—k J+m'-k 
x PioPoi Pu Poo 



(25) 



Eq. (7) in the main text follows from the Bloch parame- 
terization of a rebit used there 



_ 1 + rcos(6»/2) 1 - rcos(0/2) 

POO — n , Pll — ^ s 



P01 = PW = 



rsin(0/2) 



(26) 



Finally, Eq. (TT91 can also be used to provide an alter- 
native derivation of the well-known Wigner d-matriccs. 
Consider the block decomposition of the product repre- 
sentation N of the SU (2) "rotation" about the y-axis: 

A P A $ A A ■ P 

d 00 = cos-, dn=cos-, doi = -aio = -sm— . (27) 
Proceeding as above [Eqs. (f2"3"f to (|25p] one obtains 

fowls' - D^C (-1)-*'** 



m—m -\-2k 



(28) 



and we can further simplify the expression of the matrix which are the standard Wigner d-matrices. 
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II. One-way LOCC State discrimination using 
dynamic programming 

Dynamic Programming (DP) is a powerful numerical 
technique with a wide range of applicability that, roughly 
speaking, consists in casting an optimization problem 
into a recursive form. It can be applied to problems where 
decisions that involve smaller-sized sub-problems are 
taken at different stages of the optimization process. Its 
main advantage is that it avoids exploring those deci- 
sions that cannot possibly be optimal. Here, we present 
a DP formulation of multiple-copy discrimination among 
M possible states {pk}kLi using a local one-way adap- 
tive strategy. It is a slight generalization of the problem 
considered in the main text, for which M = 2. 

Let us assume that we are given N identical copies 
of a state that is known to belong to the set {pk}fc=i, 
and let n(k) be the prior probability (prior for brevity) 
that the given state be pk\ for convenience we write 7T = 
{ 7r (' c )}fcli- We view 7r as a free variable, which can take 
arbitrary values in [0,1] M . The discrimination protocol 
we wish to optimize consists in measuring each copy se- 
quentially, and returning the alleged identity of the given 
state after completion of the whole sequence, once the 
iV-th copy has been measured. When an outcome is ob- 
tained, we consequently update the priors and perform a 
new measurement on the next copy. This new measure- 
ment is specifically chosen in accordance with the just 
updated priors. 

For simplicity, let us here assume that the POVM oper- 
ators that characterize the local measurements have the 
form {E x ((p)}^ =1 , i. e., they can be parameterized by 
a set of variables which we have arranged into the vec- 
tor cp, and W is the number of possible outcomes. If the 
unknown state is pk the (conditioned) probability of ob- 
taining the outcome x when performing the measurement 
on a copy is given by the Born rule 

p(x\k;(p)=\x[E x (<p)p k ], (29) 

where a semicolon will always be used to separate random 
variables from any other argument the various probabil- 
ities may depend upon. As mentioned above, on each 
copy we allow for a specific measurement depending on 
the current value of the priors. Our aim is to optimize 
the function <p(ir) at each step of the measurement se- 
quence so that the success probability of discrimination 
after the iV-th (last) measurement is as large as possi- 
ble. We will denote the optimal choices by <£* (tt), where 
n = 1,2, ... ,N refers to the order in the sequence and 
the asterisk (*) stands for "optimal". In order to do so 
in a recursive way, we still need to introduce a further 
probability function, which we denote by S*_i(tv). It is 
defined to be the maximum average success probability 
if we drop the first n — 1 copies, namely, if the sequence 
of measurements starts at copy n, and assuming that 
the priors are 7r. Again, the asterisk reminds us that 



this sequence of N — n + 1 measurements is optimal for 
the given priors. Notice that all information about the 
identity of the unknown state that we would gain by per- 
forming measurements on the first n — 1 copies is encoded 
in the priors 7r (i. e., in the posterior probabilities if the 
measurements on the first n — 1 copies were performed) . 
Notice also that the success probability we wish to com- 
pute is given by S(j(n), and that 

S* n (tz) = max{7r(fc)}. (30) 

fe 

In words, this equation states that after performing the 
last measurement, the optimal decision is to identify the 
unknown state as being the state in {pu}kLi with maxi- 
mum posterior probability (Bayes decision). 

We next establish a recursion relation that will give 
us S*_ 1 (tt) once we know S^(tt). After performing the 
measurement on the n-th copy, and assuming we have 
obtained outcome x> the well known Bayes rule enables 
us to update the priors as tv — > iv' (\; tt , <p) , with k- 
components given by: 

T (« X 7T, <P) = 7 \ * (31) 

Here the marginal probability in the denominator (the 
probability of obtaining outcome x) is 

=^2p(x\k;v>)7r(k), (32) 
fe 

and we explicitly show all the dependencies involved in 
the updating. The success probability for this sequence 
of measurement starting with the n-th copy satisfies 

S n -i(ir,<p) = ^p(x;7r,¥5)5,*(7r'(x;7r,y>)); (33) 

x 

each term in this sum is the (joint) probability of obtain- 
ing outcome x m the n-th measurement and succeed- 
ing at the end of the discrimination protocol. Hence, 
this sum gives the success probability S n -i(n, ip). Note, 
however, that it still depends on our measurement choice 
through ip and optimality demands that we maximize 
this expression over all possible values of these parame- 
ters, i. e., 

S£_ 1 (7r)=maxS , n _i(7r, <p) =S n -i(n, <pl(nj) , (34) 
v 

where the last equation defines V 3 n( 7r ) as the choice of 
parameters that maximize S n -i(Tr,tp). Eqs. (|3"Tj) to (|34|) 
enable us to recursively compute the success probabil- 
ity Sq(tt) starting with the initial condition (f3"U|). This 
method is efficient as only the values of S*(ir) are needed 
to obtain S*_i(ir) and these can be stored at each step 
of the optimization. 

We should point out here that in general the opti- 
mal success probability S* (w) and measurement param- 
eter functions cp* (tt) can only be computed numerically, 
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which requires discretizing [0, 1] M with a fairly large set 
of lattice points and using interpolation methods. For 
two qubits discrimination with von Neumann measure- 
ments <p is one-dimentional, i. e., it is only the azimuthal 
angle on the Bloch sphere, and so is tc, as 7r(l) = 1 — 7r(0), 
and DP is very efficient. In this simple case and with 
equal priors the error probability is 

P e = l-S *(l/2). (35) 

We have used a one dimensional lattice of 20,000 points 
to compute our results and checked that they are robust 
against changes of the number of lattice points. 

Having explained the DP technique, one may wonder 
whether or not this is indeed the most general one-way 
adaptive local strategy. To show that this is so, we for- 
mally iterate Eqs. pip to (|34| and obtain 

Sq(tv) = max) maxN •••max N max ) 

Xi X2 XN-i Xn 

x maxj^fc) ]jKXn|fc;v n )| • (36) 
The argument of max/j in (f36|) is p({Xn}, k; n, {<£„}), 



the joint probability of obtaining a sequence of out- 
comes {Xn}n=i ({Xn} in short, and similarly for {<£„}), 
and the unknown state being pk- The maximization 
over k corresponds to the Bayes decision: given an out- 
come, or sequence of outcomes, decide for the state pk 
that gives that outcome with highest probability. The 
sum over all outcomes gives the total (or average) suc- 
cess probability. Note that the maximization over mea- 
surement parameters is performed in a specific order. 
Because of this, the choice of tp 1 is independent of the 
outcomes However, since the sum over \i pre- 

cedes max Va , we can make the second measurement de- 
pend on the outcome of the first one, i.e. <p 2 (xi)- in 
general, <p n ({Xm}]!n=i) (* ne measurement depends 
on the outcomes obtained in the previous ones). Hence 
we find that, indeed, the DP procedure returns the opti- 
mal one-way LOCC strategy (within the family of local 
POVM's parametrized by tp). 

Note that if we were to follow a brute force method 
and make direct use of (|36[) . we would be bound to op- 
timize over an impractically large number of parame- 
ters ~ exp N , instead of the ~ N optimization parame- 
ters present in the equivalent DP formulation. 



